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COUNTING PLANAR RANDOM WALK HOLES 

By Christian Benes 
Tufts University 

We study two variants of the notion of holes formed by planar 
simple random walk of time duration 2n and the areas associated 
with them. We prove in both cases that the number of holes of area 
greater than A{n), where {A{n)} is an increasing sequence, is, up to 
a logarithmic correction term, asymptotic to n ■ A{n)~^ for a range 
of large holes, thus confirming an observation by Mandelbrot. A con- 
sequence is that the largest hole has an area which is logarithmically 
asymptotic to n. We also discuss the different exponent of 5/3 ob- 
served by Mandelbrot for small holes. 

1. Introduction. The object of our study in this paper is planar simple 
random walk S defined by S{0) = (0, 0) and forn G N = {1, 2, . . .} by S{n) = 
J2^=i where {XjjjgN are independent random vectors satisfying f{Xj = 
zbcj) = 1/4, i = 1,2, where ei = (1,0) and 62 = (0, 1). We will also think of 
planar simple random walk S as being a continuous process, that is, for 
noninteger times t, we let S{t) be the linear interpolation of the walk's 
position at the surrounding integer times: For all real t>0, 

(1.1) s{t)=s{[t]) + {t-ms{[t] + i)-s{m, 

where [t] denotes the integer part of t. For any real numbers < a < 6, we will 
write S[a, b] := {S{t)}a<t<bj and use the same notation for Brownian motion 
B. Let the holes or components made by S'[0,2n] be the connected compo- 
nents of C\5'[0, 2n], where C denotes the complex plane, and the lattice holes 
made by 5[0, 2n] be the connected components of \ {5'(j)}jg|o 2„}. Two 
points z and w lie in a same component of I? \ {S{j)}j^s^Q 2n} if they can 
be joined by a nearest-neighbor path in that does not intersect 5[0,2?i]. 
The area of a hole is defined to be its Lebesgue measure and the lattice area 
of a lattice hole is its cardinality. See Figure 1. 
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Fig. 1. Two types of holes and area: here, area — 8, lattice area = 2. 

We define for any r G M+, n G N, 

"Hnir) = #{holes of area > r made by S[0, 2n]}, 

(1.2) Cn{r) = ^{lattice holes of lattice area > r made by S[0, 2n]}, 

Our main result is the following: 

Theorem 1.1. There exists a 5o> such that for all <5 <5o, 

(1.3) i^^$^iV„(5)^27r asn^oo, 

n° 

(1.4) ^-^^^Nn{6)^2TT asn^oo, 

n" 

P 

where denotes convergence in probability. 

We first outline the key ideas of the argument for (1.3), which is the part 
of Theorem 1.1 for which we give a full proof in this paper. The proof of 
(1.4) is practically identical and we will just mention in Section 5 which 
small modifications are needed to obtain it. 

1. Use Brownian scaling to extend the result in [10] (see the Appendix) for 
the number of components of C \ B[0, 1] of area greater than e > to 
the number of components of C \ B[0, n] of area greater than n^~^, 6 > 0. 
Then couple planar random walk S and planar Brownian motion B via 
Skorokhod embedding and compare, for some 6o > and all < 6 < 6q, 
the number of holes of area larger than n^~^ for B[0, n] and S[0, 2n] under 
the coupling, as described in the next steps. 
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2. Let c = 1 + e, where e > 0, and for j G {0, 1, 2, . . .},n G N, define 

(1.5) Ij,n = IjA^,e) = [ni-V,ni-V+i). 
This gives a decomposition of [n^^^,oo) =Uj^o 

Nj^n = ^j,n{S,£) = ^{components of C \ i?[0,n] with area in /j,n}, 
Nj^n = ^j,n{^, = #{components of C \ ^[0, 2n] with area in Ij^, 
and show that for every j >0, every e > smah enough, 

(1.6) F{\Nj^n — Nj,n\ > e-^j,n) ^ fast enough, as n ^ oo, 

so that the sum over j of these probabihties goes to 0, which then imphes 
F{\Nn{6) - Nn{6)\ > eNn{6)) 0, 

where 

Nn{S) = ^{connected components of C \ i?[0,n] of area > n^~^}. 

This is done by comparing the total area of all Brownian components 
with area in with the total area of all random walk components with 
area in Ij^n- 

3. To show (1.6), analyze A{z) = An{z) := ||C„(2;)| — |C„(z)|| for each z £ 
1? ^ where |C„(z)| is the area of the connected component of C \ i?[0,n] 
containing z, |Cn(2;)| is the area of the connected component of C\S'[0, 2n] 
containing z. More specifically, show that on "good configurations," if 
max{|C„(2;)|, |Cn(z)|} is large, A(z) is of smaller order of magnitude than 
max{|C„(z)|, 1^(2)1}. 

4. Show that "bad configurations" are unlikely. This involves: 

• Handling the case where z is close to dCn{z) or 9C„(z), which is done 
with the help of ideas relating the two-sided disconnection exponent 
for Brownian motion and random walk to the fractal dimension of the 
Brownian frontier. 

• Looking at other "bad cases" which can occur even if z is far from 
dCn{z) and dCn{z): z being disconnected from oo by shortly before 
time n, not leaving S enough time to disconnect z from oo as well 
(or the other way around); Cn{z) being a very thin component, and 
the same for Cn{z). The tools used for these cases are the one-sided 
disconnection exponent and Beurling estimates for Brownian motion 
and random walk. 

In Section 2, we give a list of definitions of the objects most commonly 
used throughout this paper and introduce Skorokhod embedding, the cou- 
pling which is at the center of our proof. In Section 3, we look at how "thick" 
the boundary of a Brownian motion or random walk component is. The ideas 



4 



C. BENES 



used are based on the method of [4] , which exhibits the relationship between 
the two-sided disconnection exponent of Brownian motion and the Haus- 
dorff dimension of the Brownian frontier. Section 4 contains a sequence of 
preparatory lemmas leading to a comparison between the areas of the Brow- 
nian motion and random walk component containing a given lattice point. 
In Section 5, we use the results of Sections 3 and 4 to prove Proposition 5.1 
from which Theorem 1.1 follows immediately. In Section 6, we discuss the 
initial motivation of our study, namely Mandelbrot's observation of a differ- 
ent exponent for small lattice holes, and give a precise conjecture for this 
observation. In particular, we show that if the picture suggested by Mandel- 
brot is the right one, the regime in which the 5/3 exponent exists must be 
very small. Finally, the Appendix provides the consequences of [10] needed 
to make the link between small components of C \ B[0, 1] and large compo- 
nents of C \ S'[0,2n]. Many basic estimates for planar Brownian motion and 
random walk are used throughout this paper, in particular large deviations 
and Beurling estimates. We will only include their statements in this paper, 
but their derivations, as well as a discussion on Skorokhod embedding, can 
be found in [1]. 

2. Definitions and tools. Throughout this paper, all multiplicative con- 
stants will be denoted by K,Ki or i^2- It will be understood that they 
may be different from one line to the next. The letters r,s,t will be used 
to denote real numbers, while i,j,k, 1, 111,71 will represent natural numbers. 
Points of the complex plane C will be represented by the letters u,v,w,z. 
The symbols O, o and ~ will have the usual meaning: for two functions 
/ and g, f{x) = 0{g{x)) if there exists a constant K such that f[x) < 
Kg{x) for ah x, f{x) = o{g{x)) if lim^^^oo f {x) / g{x) = 0, and f{x) ~ g{x) 
if \\m.x^oo f {x) / g{x) = 1. We say that a function f[x) decays rapidly if for 
every r S M, /(x) = o{x~'''). 

For any z , we define Sq(z) to be the closed region bounded by the 
square centered at z, whose sides are parallel to the axes and of length 1. 
For each z G Z^,Sq(2;) will be called a lattice square (or just square). The 
Euclidean norm of a point x in M or C is |x| and the integer part of x S ffi 
is [x]. The boundary of a set ^4 C C will be denoted by dA, its area by |^| 
and its diameter diam(A) = sup^ ^eA l^t' — -^l- The distance between two sets 
A, 5 e C is d{A, B) = \nix(zA,y(iB \x-y\. For any sets AdF, A" = F\A wih 
be the complement of A in F. It will always be clear from context what F 
is meant to be. 

Unless stated otherwise, B = {B{t))t>o will denote standard planar Brow- 
nian motion and S = («S'(n))„>o will stand for planar simple random walk. 
As a general rule, in this paper, a tilde will refer to a quantity related to 
random walk. S will also denote the continuous process {S{t))t>o obtained 



COUNTING PLANAR RANDOM WALK HOLES 



5 



from planar simple random walk as in (1.1). It will be clear from the choice 
of the letter for the argument if we consider real or integer times. 

At the center of the method used in this paper lies a coupling of planar 
random walk S and planar Brownian motion i?. It is a particular case of 
Skorokhod embedding. We state it here and refer the reader to [1] for more 
details. 

Proposition 2.1. There exist a probability space {0,,J^,¥) containing 
a standard Brownian motion B and simple random walk S in the plane, 
constants b,K > 0, such that 



From here on, we will be working in this probability space. For any t>0, 
we let 

(2.2) Ct{z) = the connected component of C\-B[0,t] containing z 
ii z€C\B[0,t] and 

Ct{z) = the connected component of C \ ^[O, 2t] containing z 

if ^GC\5[0,2t]. Uze B[0,t],welet ^(z) = and if z G S[0, 2t], Cj (z) = 0. 
We will use the convention that |0| =0. For any z gC and R G M+, 

(2.3) D{z,R)={w£C:\w- z\<R} 

will denote the closed disk of radius R, centered at z and D{R) will be short 
for D{0,R). 

Two estimates for Brownian motion and random walk will be useful in the 
proofs of the next two propositions, as well as later in this paper. They are 
large deviations estimates giving an upper bound for the probability that in 
time n random walk or Brownian motion travel much beyond distance ^/n 
or remain in a disk of radius much smaller than ^/n. For the proofs, see [1]. 

Lemma 2.1. If B is a planar Brownian motion, S a planar simple ran- 
dom walk, there exists a constant K > such that for every n>0, every 



(2.1) 




r > 1 



(2.4) 




(2.5) 




max 

0<fc<2n 
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Lemma 2.2. If B is a planar Brownian motion, S a planar simple ran- 
dom walk, there exists a constant K > such that for every n> 0, every 
r>l, 

P( sup \B{t)\<r-^^/n) <exp{-Kr'^}, 

\0<t<n J 

(2.6) 

f( max \S(k)\ <r-^^/n] Kexpi-Kr"^}. 

\0<fc<2n' ^ ^' - J ~ 

We will also need on several occasions the following well known result 
which we give here without a proof (see [1] for the continuous case and [2] 
for the more difficult discrete case): 

Theorem 2.1 (Beurling estimate). 

1. There exists a constant K > such that for any R>1, any x £ C with 
\x\ < R, any AcC with [0, R] C {\z\ : z £ A}, 

(2.7) r^{CR<TA)<K{\x\/R)'/^ 

where ^R = mi{t > 0: |-B(t)| > R} and = inf{t >0:B{t) G A}. 

2. There exists a constant K > such that for any 7i>l, any x £1? with 
\x\ < n, any connected set A<Zl? containing the origin and such that 
sup{|z| : z £ A} >n, 

¥^{^n<rA)<K{\x\/nfl\ 
where H„ = inf {A; > : 1 5(/c) | > ?i} and ta = inf { A; >{):S{k)£A}. 

3. Disconnection exponents and the holes' boundaries. Understanding 
the structure of the boundary of the components Cn{z) and Cn{z) containing 
a point z € C is essential in our approach to the problem discussed in this 
paper. We first derive in Proposition 3.1 an upper bound, uniform for all 
z G Z^, for the expected number of lattice squares (see Section 2) intersected 
by the boundary of Cn{z) or Cn{z). Proposition 3.2 then gives an upper 
bound for the expected number of lattice squares intersected by the union 
of the boundaries of all components of area greater than n^~^,6 > 0. 

The main tool needed for our estimate is the two-sided disconnection 
exponent, one of many intersection and disconnection exponents computed 
exactly in [7, 8] and [9]. Since we will make use of the one-sided disconnection 
exponent in Section 4, we mention it here as well. It was shown in [5] and [6] 
that each of these exponents is the same for Brownian motion and random 
walk. 

For any E C, we let be the probability measure associated 

with two independent planar Brownian motions B^ and B^ with B^{0) = x^ 
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and 5^(0) =x^. P^^ will denote the probability measure associated with 
the lone Brownian path started at x^. For i = l,2 and n G N, we define 
H^(x) = inf{t > 0: \B^(t) — x| > n}; if x = 0, we just write H^. 

Since we will always deal separately with Brownian motion or random 
walk in this section, we can use the same notation for S without risking any 
confusion: is the probability measure associated to two independent 

planar random walks and 5^ with S'^(O) = x^ and ^^(O) = x^, where 
x^jx"^ E and P^^ is the probability measure associated to started at 
x^. It will be clear from context whether P refers to Brownian motion or 
random walk. We also let ^^{x) = inf{/c > : \S^{k) — x\ > n}, and write 
Cn = ?n(0)) where i = 1, 2 and n G N. For any compact ^ C C, we let 

(3.1) Qi^) = the closure of the unbounded component of C \ A. 

3.1. One-sided disconnection exponent. Let 

An = {D{l)nQ{B^[O,Ei])^0} and = {{0,0) n Q{S^[0,^i]) ^ 0}, 

where D{1) is the closed unit disk centered at the origin, and write 

P(A„) = supP^(A„), P(i„) = P0(i„), 

where the sup is over all x with \x\ < 1. 

The following lemma is a consequence of [7, 8] and [9], where the value 
of the Brownian disconnection exponent is computed and [5], where the 
equality between the Brownian and the random walk exponents is shown. 

Lemma 3.1. There exists a constant K > such that for all n> 1; 

(a) F{An) < Kn-^l^, 

(b) P(in) <^n-^/^- 

3.2. Two-sided disconnection exponent. Let 

Fn = {D{1) n Q{B' [0, n] U ^^[0, n]) / 0}, 
Dn = {D{1) n Q{B'[0, Ei] U B^[0, H^]) / 0}, 
Fn = {(0, 0) n Q{S' [0, n] U S^[0, n]) / 0}, 

Dn = {(0, 0) n Qis' [0, u [0, fj) + 0}. 

We will write 

P(F„) = supP^'''^'(F„) and P(Z)„,) = sup P^'''^'' 

where the sup is over all < 1, < 1 and 

P(F„) = P°'°(F„) and P(D,) = P°'°(L)„). 

Lemma 3.2 is based on the same papers as Lemma 3.1, except that the 
equality between the Brownian and random walk exponents follows from [6]. 
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Lemma 3.2. There exists a constant K > such that for all n>l: 



3.3. Two estimates for the holes' boundaries. It was shown in [4] that 
there is a strong hnk between the two-sided disconnection exponent and 
the Hausdorff dimension of the Brownian frontier, defined as the boundary 
of the unbounded component of C \ B[0, 1]. This can be seen by observing 
that if < t < 1, B{t) is in the frontier of B[0, 1] if B[0,t] U B[t, 1] does not 
disconnect B{t) from infinity and that B[t, 1] and the time-reversal of -B[0,t], 
{B{t — s))o<s<t, are independent Brownian motions. [For two bounded sets 
A,B cC, we will say that A does not disconnect B from oo if BnQ{A) ^ 0.] 
The proofs of the next two propositions are based on this idea. 

In both propositions, the statements for random walk and Brownian mo- 
tion are the same, but the fact that Brownian motion has more freedom to 
wander than random walk in a unit time interval makes the proofs slightly 
more technical in the Brownian case. For the sake of variety, we prove the 
first for random walk and the second for Brownian motion. 

Proposition 3.1 gives an upper bound, uniform for all z ^1? ^ for the 
expected number of lattice squares which are intersected by the boundary 
of Cn{z) or Cn{z). Recall that a function is rapidly decaying if it goes to 
faster than any power function. 

Proposition 3.1. There exists a constant K > such that for every 
z G 1? , every n > 1, 



Proof. We prove (3.2). By Lemma 2.1, there is a rapidly decaying 
function (/>i such that 



(a) 
(b) 
(c) 
(d) 



IP(^„) <Kn-i/3, 



E ^ l{Sq(y)n5C„(z)/0} <i^n2/3(logn)^^/3 



and 



(3.2) 



E ^ l{Sq(y)n5C'„(z)/0} <Kn^l^i^ognf^l^ 



E 



l{Sq(y)n9C'„(z)/0} 



2n 



<Y,ns{j)^dCn{z)) 



j=0 
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2n 

= T.nsij)&dCnizy,sij) = w) 

w&? j=0 

2n 

< J2 T.^iSij)edCn{z);S{j)=w)+Mn), 

iue-D(\/nlogn.)i=0 

where for r G R+,D{r) = D{r) r\Z^ = {z ^I? : \z\ < r}. For all < j < 2n, 
we let 

sf\i) = S{3-i)-S{j), 0<i<j, 

Sf^ (i) = Sij + S{j), 0<i<2n-j, 

be the translates starting at the origin of the time-reversal of the part of S 
up to time j and the portion of the walk from time j to 2n, respectively. 
Then 5f ^ and sf are independent simple random walks, started at 0. 

By (3.3), we just need to find a bound for every z G Z^, w G D{^/nlogn), 
and every < j <2n, for 

(3.4) FiS{j)GdCnizy,Sij)=w). 

To ease the reader's work, we quickly outline the ideas involved in the 
simplest case, where \z\ > y/n\ogn and z is therefore very likely to be in 
Q{S[0, 2n]), where Q is as in (3.1). In this case, for a typical w and a typical 
walk S, the probability in (3.4) is bounded above by 

F{S{j) GQ{S[0,j]U S[j,2n]y,S{j) =w), 

which is equal to 

P(0 G Qisf [0, j] U Sf [0, 2n - j]); 5(i) = w). 

Unfortunately, the events {0 G QiS^'^^ [0, j] U 5^^^ [0, 2n - j])} and {S{j) = w} 

are not independent, since the first event influences the shape of /^^^^[O,^] 
which has an influence on the second event. If they were, the local central 
limit theorem (see [1]) and Lemma 2.1 would imply that for j > l,P(5(j) G 
dCn{z)] S{j) = w) < J^(min{j,n — ■ and summing this over 1 < 

j <n — 1 and w G D{y/nlogn) would give a bound of Kn^^^log^ n. It turns 
out that this heuristic argument works if we consider the slightly different 
events 

{0 G Qisf[0, [i/log^n]] U Sf[0, [(2n - j)/log'n]])} 

and {S{j) = w}, which are "almost" independent. Indeed, S^-^^ [0, [j'/log^ n]] 
is a much shorter path than S'j^^ [[j'/ log^ n], j] and its shape has little influ- 
ence on the position of S{j), which is mostly determined by Sj^^ [[j / log^ n] , j] . 
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The introduced logarithmic term causes almost no loss in the bound derived 
in the heuristic argument. For general z and more work is required, espe- 
cially since only the parts of S^^^ and S^^^ that lie in D{w^ \z — w\) determine 

whether w S dCniz) or not. 

We need to estimate the probability in (3.4) in a different way in each of 
the following four exhaustive cases: 

(i) 1^ — ■wp < j < n — j, 

(ii) \z — <n — i < 2^ 

(iii) J < min{n — j, \z — w^}, 

(iv) n — j < min{j, \z — 'w\'^}. 

We will consider cases (i) and (iii). Case (ii) [resp. (iv)] is handled exactly 
like that of (i) [resp. (iii)]. Note that j<n — j4^j< n/2. 

Case (i): For w,z€ 1? , we let T = T{n,w,z) = [2{\z - w\/log^nf] and 
define the event 

T = T{j,n,w, z) = < sup |S'-^^(i)| < |z — tt;|; sup |5-^^(i)| < — ^«| >. 

lo<j<T o<i<r J 

Note that by (2.5), 4>2{n) = P(T'^) decays rapidly and observe that if 5(j) = w, 

S{j) G dCn{z), and T occurs, then S^^^ [0, T] U S*]^^ [0, T] cannot disconnect 
from infinity. We define 

(3.5) D^AjB) = {A does not disconnect B from oo}, 

and will write V = V{sf[0,T]uSf\0,T],0). If |z - < j < n/2, then 

FiSij)€dCniz);Sij) = w) 

<¥iT')+F{{S{j)=w};V;T) 

< Mn) + ^F({5(j -T)=u; S{j) = w};V), 

where the sum is over all u £ D{w, \z — w\) = D{w, |z — ty|) H and (j)2{n) 
decays rapidly. 

The Markov property applied at time j — T and translation invariance of 
simple random walk can be used to see that this last expression is bounded 
above by 

^ P(S(j - T) = u)F{V- {S{T) = w-u}) 

< supP(S(j -T)=u)J2 ff'(^; {SiT) =w-u}) 

<supF{S{j -T)=u)F{V), 

where the sup and the sum are over the same set as above. Since T < 
j/log^n, the local central limit theorem implies that the sup is bounded 
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above by K/ j for some constant K > 0, independent of j and n. Using 
Lemma 3.2(c) to bound P(I'), we find that if \z — w\'^ < J < ''^/2, there exist 
a constant K > 0, independent of z,w,j and n, and a rapidly decaying 
function (p2 such that 

(3.6) r{S{j)edCnizy,S{j) = w)<-0-p^) '^\Mn). 

J V log^ n J 

Case (iii): U j < mm{n/2,\z - w\^},U = U{j,n) = [j^^], and 

U = h({j,n,w, z) = I sup |5'-"^^(i)| < |z — if;|; sup |S'-^^(i)| < — >, 
lo<i<u 0<i<U J 

then (j)z{n) = T{hl'^) decays rapidly. Using the notation defined in (3.5), we 
write 

V' = V{sf [0, U] U Sf [0, [/], 0). 

Under W,{S{j) G (9C„(z)} implies V . Therefore, using again the Markov 
property and the local central limit theorem, 

nS{i)(^dCn{z);S{j) = w) 

< J2 n{S{j) = w- S{j -U)= u};V') + Un) 

(3.7) 

< sup P(5(i -U) = u)- P(P') + (/.3(n) 



<-(i/log4n)-V3 + 03(„) = i^('_l 



i ° ' ■ / Vlog^n 

Here, the sum is over all u G D{w, ^l^^J^ ) and K is again independent of 
2, J, and n. 

We find in the same way as in case (i) that \i \z — w\'^ < n — j < n /2, 

(3.8) ^[S{j)(^dCn{z);S{j)=w)<-(\^^) ^'\un) 

3 \ log^ n J 

and in the same way as in case (iii) that if n — j < min{j, |z — tfp}, 

o-^ax -2/3 

(3.9) nS{j)(^dCn{z);S{j)=w)<K( \ ^' +<^5(n). 

V log n / 

We can now wrap up the proof by adding all our bounds: For a given z 
with |z — < n/2, we now obtain from (3.3), (3.6)-(3.9) that there exist 
a rapidly decaying function (/>(n) and a constant K such that for all z 
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and n G N, with the notation c=\z — w\'^ and D = D{y/n\ogn), 
E[#{yGZ2:Sq(y)n5a(^)/0}] 



2 + E 



c] / -2 \ -2/3 



log^ n 



n-[cAn/2] , i \ -2/3 n-1 // •^2\-2/3■ 



< 0(n) + K 



■ Fa /oi i V log^n J . ^ . , V log^n 

j=[cAn/2] ^ o j=n-[c] " 

/ W K2] I ,„|-2/3\ 

<<^(n) + Klog^/3n 5: 2 + 2^r'/3+ ^ ^^^^ 

wGD(Vnlogn) ^ i=l i=[cAn/2] / 

< + E:iog^/^n ^ 1^-^1-2/3(1 + log n) 

«'S-D(Y^logn) 

<Kn2/3(logn)"/3, □ 

For < (5 < 1, we define 

Cn = Cn(S) = {z^C:\Cn{,z)\>n^-^}, 

Cn = Cn{b) = {z^C:\Cn{z)\>n^-^}. 

In the proof of Proposition 5.1, we will need an estimate for the expected 
number of lattice squares (for the definition, see Section 2) which are inter- 
sected by the boundary of C.„(5) and C„((5). The ideas of the proof below 
are similar to those of Proposition 3.1, but the fact that we do not have 
to worry about the location of relatively to a point z makes this next 
proof considerably simpler. 

Proposition 3.2. There exists a constant K > such that for every 
S > 0, every n>l, 

(3.10) E[#{y G : Sq(y) n dCn{S) / 0}] < Kn^^+^^/^{lognf/'^ 
and 

(3.11) E[#{y G : Sq{y) D dCn{S) / 0}] < Kn^^+^^^^{\ognf/\ 

Proof. We prove (3.10) which presents a minor additional difficulty, 
since we have less control over the diameter of a path of time length one 
for Brownian motion than for random walk. The other ideas for the proof 
of (3.11) are the same as for (3.10). The general strategy of the proof is to 
find bounds for the expected number of time segments [j — over which 
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the Brownian path intersects dCn{S). This will suffice since the expected 
number of lattice squares intersected by B[j — is finite. For I < j <n, 
let dj = diam{B[j — and Bj = D{B{j),dj) be the closed disk of radius 
dj, centered at B{j), so that B[j — G Bj. We let n = [n"*^^*^ log"^ n] and 
define 



Aj = B[max{0, j - 1 - n}, j - 1] and Af = B[j, mm{2j + n}]. 

Aj' and Aj^ span time intervals of same length and that length is j — 1 if 
j <n + l and re if j > n + 1 . 

Our choice of re is motivated by the following: By Lemma 2.1, it is very 
likely that for every 1 < j < re, Aj U A^ is completely contained in a disk of 

radius More precisely, if V = {Af U Af C D{B{j),n^^-^^/^ /2)} , 

then P(V'^) = 0(re), where (p is a rapidly decaying function. On the event V, 
then, if B[j — intersects dCn{S), Aj LI Af cannot disconnect Bj from 
infinity. Indeed, if it did, Bj would be disconnected from infinity by a portion 
of the Brownian path entirely contained in a disk of radius re(^~^)/^/2 and 
could intersect the boundary of components of area no more than n^~^ . 

The rest of the argument just involves taking care of the case where 
dj is unusually large and applying Lemma 3.2(a). Let m = [re/2] + 1. By 
symmetry, we have 

E[#{yGZ2:Sq(y)n9C„/0}] 

(3.12) 

m 

= 2j2n*{y e : Sq(y) n dCn nB[j- 1, j] / 0}]. 
i=i 

We first look at the terms for 1 < j < re (in which case Aj and Aj^ each span 
a time interval of length j). There is a > such that 

E[#{y G Z2 : Sq(y) n dCn n B[j - / 0}] 

= J2 n#{y G : Sq(y) n 5C„ n B[j - / 0};dj G [/ - 1, 1]] 
l>i 

<kY, I'^HdCn n Bj / 0; dj G [/ - 1, /]) 
i>i 

(3.13) < K (^og^ n{F{{Bj G Q{Af U Afy,dj < log re}; V) + (/>(re)) 

/>logn / 




(>logn 
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where (f) decays rapidly and the last inequality follows from Brownian scaling 
and Lemma 2.1. Therefore, 

n 

^ E[#{y G Z2 : Sq(y) n 9C„ n S[j - 1, j] / 0}] 

n 

<i^(logn)«/3gr'/' 
In the same way, we find 

m 

^ E[#{y G Z2 : Sq(y) n 5C„ n - / 0}] 

■,=n+l 

< An log n — z — 
\ n J 

<ir7i(2+5)/3log8/3n, 
which, combined with (3.12) and (3.13), gives the proposition. □ 

4. Comparing Brownian and random walk areas. A key step in the proof 
of Theorem 1.1 is to show that if B and S are coupled as in Proposition 2.1, 
then under certain favorable conditions, the difference between the areas of 
the Brownian and the random walk components containing a given point 
z G Z^, A„(2;) = ||C„(2;)| — |C'„(z)||, has an expected value of smaller order 
of magnitude than the areas themselves. The conditions are that at least 
one of |Cn(z)| and |C'„(^;)| is > n^~* for some specific b G (0,1), and that 
z is not too close to dCn{z). This estimate is given in Proposition 4.1 and 
requires finding an upper bound, for fixed z, for the number of points y G 
with y G {Cn{z) \ Cn{z)) U {Cn{z) \ Cn{z)). Of course this number is only 
meaningful if |C„(z)| < oo and [(^^(z)! < oo and we will estimate E[A„(z)] 
with the assumption that this is the case. The points for which |Cn(z)| = oo 
or |C'„(z)| = oo will be dealt with in the proof of Proposition 5.1. A useful 
estimate for that purpose is derived in Lemma 4.3. 

If both components are finite, a point y is in Cn{z) \ Cn{z) if it has been 
disconnected from zhj S but not by B. Informally, this can happen if: 

1. Either y or z is closer to dCn{z) than the distance prescribed by the 
coupling in (2.1). 
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2. y is disconnected from z by 5 very late, that is, at a time close to 2n, in 
which case, B gets "very close" to disconnecting y from z, but may not 
have time to do so. 

3. B gets very close to disconnecting y from z, but does not, despite having 
plenty of time to do so. 

The main results needed to handle Case 1. were derived in the previ- 
ous section. We provide the estimates needed for Cases 2 and 3 in Lem- 
mas 4.5 and 4.4, respectively. These two lemmas are then used to prove 
Lemma 4.6, which gives a bound for the probability that y S {Cn{z)\Cn{z))U 
{Cn{z)\Cn{z)) if both components are bounded, one of them is large enough, 
and y,z are not too close to dCn{z) or dCn{z). 

Lemmas 4.1 and 4.2 give estimates similar to those in Lemmas 4.4 and 4.5, 
but are concerned with the first time a point z is disconnected from oo 
rather than from another point y. They are used to prove Lemma 4.3, which, 
as mentioned above, is used to deal with the case where |C„(z)| = oo or 

\Cn{z)\ =00. 

Note that Lemmas 4.1 and 4.4 are results about Brownian motion or 
random walk only, while the other lemmas of this section address questions 
about the joint behavior of the coupled random walk and Brownian motion. 
All the lemmas in this section have statements that come in pairs, where 
one version is obtained from the other by interchanging the roles of B and 
S. 

We will be interested in the first time at which an arbitrary point lies in 
a finite component and in studying how long it takes from a time at which 
it "almost" lies in a finite component until it actually does. This motivates 
the definition of the closing times for z by Brownian motion and random 
walk: 

r^ = inf{t>0:0<|Ct(z)| < cx)}, f^ = \ni{t>Q:{)<\Ct{z)\ <oo}, 

where Ct{z) [resp. Ct{z)\ is the connected component of C\i?[0,t] [resp. 
C\5[0, 2i]] containing z and |Cf(2;)| [resp. |C't(2:)|] was defined to be if 
z G -B[0,t] (resp. z e 5[0,2t]). We use here the convention that inf = oo. 
The closing points for z are 

X;, = B{T-^) and X:, = S{fz). 

For a curve 7: [a, 6] — > C and a point z ^ {7(t)}a<t<fei we let aig^{'^{t)) 
denote the continuous argument of 7 about z, with the convention that 
arg^(7(0)) = 0. This is well defined (see [13]). Note that aig^{-) is defined 
on the parametric interval [a, 6], not on the image of 7. The curves 7 with 
which we will be working are of course B and 5 and as the points z we 
are interested in will always lie off the paths of B and 5, the argument will 
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always be well defined. We assume henceforth that this is the case. We will 
use the abbreviation 



= inf{t > : 30 < s < t with B{s) = B{t), \ arg,{B{s), B{t))\ / 0}, 

and similarly for T^. Note that and are stopping times. The last call 
for z by B and the last call for z by S are, respectively, 



Observe the factors of 2 in the definition of Tj , which are due to the fact that 
in the Skorokhod embedding of Proposition 2.1, B and S run on different 
clocks. The last call points are x^, = B{Tl) and x\ = 5(2Tj). Note that by 
continuity of B and S, <Tz and < T^. 

The first lemma of this section shows that typically the closing time by B 
for any point comes "soon" after the last call. For such a point z, and any 
a > 1/2, this lemma will imply that if < n — n", it is unlikely that z will 
not be disconnected from infinity by i?[0,n]. Since if z is not too close to 
dCn{z) and B and S are close to each other, we have < T^, this means 
that {Cn{z) < oo; Cn{z) = oo} is an unlikely event unless Tz>n — that 
is, z is disconnected from oo by 5" "very late." This in turn will be shown 
to be unlikely in Lemma 4.2. 

Lemma 4.1. There exists a constant K > such that for any z G 7? , 
any n G N and any a> 1/2 ; 

(a) P(T^-Ti>n")<Kn(^-2°^)/24logn. 

(b) P(T^ -fi> n"") < Kn(i^2a)/24 ^^g^^ 

Proof, (a) To find a bound for P(r, - > n") we will need to consider 
all the relative positions of z and xf, = B{Tl). This will yield two different 
bounds. The first will be better for larger Iz — while the second will be 
better for smaller |z — x^]. 



arg^(7W,7('S)) =arg^(7(t)) -arg^(7(s)). 
This definition allows us to give another characterization of ■ 
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First suppose that m{z,a) := min{|z — x^^\,n'^^'^ / logn} > 300n^/'^ log^ n. 
Note that if n G D{z,m{z,a)/2Y [recall the definition of D in (2.3)] and 
< s < t are such that C D{u,m{z, a)/ 100), then, for all n>2, we 

have the obvious rough bound 

(4.1) |arg,(B(.),i3(t))|<^. 

We define T/ = inf{t > : | arg^(5(t), 5(rj))| > 37r/2, \B{t) - x[\ < Sn^/'^ x 
log^n}, x( = B{Ti), and = inf{t > Tl:B{t) e dDl), where Dl = D{xl, 
m{z, a) /lOO). The point x( can be thought of as lying "across from" xf, on 
B[0,Ti] (see Fi gure 2). We first note that the definition of x[ implies that 
the connected random set = B [T/ , contains xl , intersects dD-l , and 
satisfies: 

1. d{x[,A,) < 3ni/4iog2(n). 

2. For any t G [T/, | arg,(S(t), 5(ri))| > vr. 

The second point is true because \avg^{B{Tf),B{Tl))\ > 37r/2 and in- 
side D|, arg^(i?(t)) does not vary by more than ^, by (4.1), since xl G 
D{z,m{z,ay2Y. 

If we let ¥^ = mf{t > T] : B{t) G dD^}, then 

F(T, -Ti> n") < F{B[Tl,Ti + n''] n = 0) 
< ¥{B[Tl,Ti + n'^] n dD^ = 0) 
+ F{B[Ti,^[]nA, = 0). 



Therefore, using the fact that Tj is a stopping time, we can combine the fact 
that 




Fig. 2. Lemma 4.1: Ti is the first time at which z is "almost" disconnected from oo. 
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decays rapidly (2.6) and the Beurling estimate (2.7) to obtain 




We now derive a second bound in the case where m{z,a) = \z — x^J. If we 
let D = D{x[,n''/'^/logn), define E = inf{t > : B{t) £ D}, and use the 
definition of Q in (3.1), we find that since z G D{x^^,\z — x^|), there is a 
rapidly decaying function such that 



we have two bounds to choose from and find from (4.2) and (4.3) that 

(4.4) P(r, - rj > n"; m{z, a) = \z-x[\)< Kn(^-2a)/24 j^g^^ 
Also, if m(z,a) =?i"/^/logn, (4.2) yields 

(4.5) F{T, -Tl> n^-^miz, a) = n^'/^/logn) < Kn^^'^"^/^ log^/^ ^_ 
Combining (4.4) and (4.5) gives the lemma. □ 

There are obviously many times in the interval [0, n] at which new points 
become disconnected from infinity by B. However, for any given point z G Z^, 
the probability that z becomes disconnected from oo "late" in the interval 
[0,n] is small. 

Lemma 4.2. There exists a constant K > such that for any z G Z^, n G 
N and a > 1/2: 



P(r^ - > n") < F{z G Q{B[TIt[ + n"])) 

< nD{xl k - xil) n Q{B[TlTl + n^) / 0) 

< P(i?(xi, \z-x\\)r^ Q{B[Tm) / 0) + P(H 



(4.3) 





(a) P(T^ G [n - n", n]) < Kn^'^^^y^ log^°/3 n, 

(b) P(Ti G [n - n'', nj) < Kn^"-^)/^ log^°/3 n, 

(c) P(T^ G [n - n'^, nj) < Kn('^^i)/3 log^^/^ n, 

(d) P(ri G [n - n*^, nj) < Kn^'^^^)/^ iogio/3 ji. 
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Proof. We prove (b). The other cases use exactly the same ideas. For 
u) € C, let Dw = Dw{o-,n) := 4n"/^ log^ n) and recall the definitions 
made in the proof of the previous lemma: T/ = inf{t > : | avg^{B{t), B{TI))\ > 
37r/2, \B{t)-x[\ < 3n^/^log^n},x( = BiTi). Since a > 1/2, if G D{w,n''/'^), 
then xl E l)^. Therefore, using Lemma 2.1, 

F{Ti G [n-n'^,n]) 

(4.6) <J2^iTi G [n-n'',n];x[£D{w,n''/^)) 

= J2 G [n - n", n]; 4 G D{w, n^/^xl £ D^) + q){n), 
where the sum is over w G • Z^) n D{^/nlogn), cj) decays rapidly and 

. ^2 ^ {(x, y) G m2 . X = fcn"/^ y = /n"/^ A:, I G Z}. 
The definition of implies that if a > 1/2, then 

(4.7) {Ti G [n - n^n];xi G L'(i«, n"/^); g L*^} <Z {z i Q(B[0, T]] U L*^)}, 

where Q is defined as in (3.1). (See Figure 3.) Define $ = = inf{t > 
: B{t) G Dyo], r = r^,^ = inf{t > $ : \B{t) -w\ = \z- w\], and = F^ ,^ = 
sup{t < Tl : \B{t) -w\ = \z- w\}. The event 

{z i Q{B% Tl] U D^)} n {0(5[$, F] U B[r\ Tl]) n D^, = 0} 

is contained in 

{z ^ Q{B[0,Ti] u D^)} n {Q(B[o,ri]) nD^ = 0}c{z^ Q{B[0,Ti])}. 

But the continuity of the Brownian path implies that < T^, so this last 
event is the empty set. It follows from (4.7) that with the definition Az^w = 
Az,w{n) = {x[ G D{w,n''/^);x-f G D^}, 

{Tl G [n - n^ n];A,,^} C {g(B[$, F] U 5[F', T]]) n / 0; A,,^}. 

Therefore, (4.6) becomes 

P(rj G [n-n",n]) 

< ^ P({Q(5[$, F] U S[F^ Ti]) nb^y^ 0}; ^,,^) + (/.(n). 

The two events in this last probability are not independent, but, exactly as 
in Proposition 3.1, we can write the probability of their intersection as the 
product of their probabilities, up to a logarithmic correction term. Noting 
that -B[$, F] and the time-reversal of -B[F', Tj] are two independent Brownian 
paths from the inside of to D{w, \z — wl) and that by the local central 
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Fig. 3. Lemma 4.2: B[0,tI] UD disconnects z from infinity. The portion of the Brownian 
path inside D{w, \z — w\) cannot disconnect D from infinity. 

limit theorem (see [3]), P(x^ G D{w,n°-/'^)]Tl, >n-n"-)< Kn"''^, where K 
is uniform for all a > 1/2, we can use Lemma 3.2 to find that 

P(Ti G [n- n^n]) < ( JlZ^V'^Vs 

\fia/z log n/ 

where the sum is over w G (n'^/^ • Z^) n D{y/nlogn). This can easily be seen 
to be bounded above by ii'n^""^)/^ log^°/^ n. □ 

Lemma 4.3 shows that if B and S are coupled as in Proposition 2.1 the 
chance that C„(z) is finite while C„(z) is infinite is small when z is not too 
close to the boundary of Cn{z). The condition for z to be away from the 
boundary of Cn{z) is essential, since otherwise, we have no control over the 
probability that the Brownian path passes on one "side" of z and the random 
walk on the other. To avoid this, we will choose z to be at a distance from 
dCn{z) which is greater than the maximal distance between the coupled 
random walk and Brownian motion, that is, n^/^log^n. We define for z G C 
the events 

B(z) = {d{z, dCn{z)) > lOOn^/4 log^ n}, 

(4.8) 

B{z) = {d{z,dCn{z)) > lOOn^/^log^n}, 

for which the dependence on n should be noted but will not be written 
explicitly in order to keep the notation simple. As always d{-,-) denotes 
Euclidean distance. We also define 

r = Vn = l sup \B{t)-S{2t)\<n^/'^log^n\, 
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the condition that B and S are close to each other as in the Skorokhod 
embedding. By (2.1) P(P^) decays rapidly. 

Lemma 4.3. There exists a constant K > such that for every z ^l? 
and n G N.' 

(a) P({|C„(^)| < oo; \Cn{z)\ = ^];B{z)-V) < i^n^ 1/30 i^gio/s ^_ 

(b) P({|C'„(z)| <oo;|C„(z)| = oo};^(z);P)< il'n-i/30iogio/3^_ 

Proof (See Figure 4). (a) The idea of the proof is that on the event 
B{z), at the instant z is disconnected from infinity by B, Tj, the last call for 
z hy S has already occurred. At that instant, either the time is very close 
to n, which is unlikely by Lemma 4.2(a), or the random walk has plenty of 
time to disconnect z from infinity and will do it with high probability by 
Lemma 4.1(b). 

We use the fact that for n large enough, B{z) nV C {Tj < T^}, which 
follows from an argument similar to the one we used at the beginning 
of the proof of Lemma 4.1: By definition of T^, there is a time T < 
such that B{T) = B{T^) and | aig^{B(T),B{T^))\ > 2-k. The conditions B{z) 
and V ensure that |arg^(S(r)) - arg^(5(2r))| < f and |arg^(5(r^)) - 
arg^(S'(2r2))| < f , so that 

(4.9) |arg,(5(2r),5(2r,))|>-. 

Also, the condition V guarantees that \S{2T) — B{T)\ < n^/^log^n and 
\S{2T^) - B{T^)\ < n^/^log^?!, which implies 

(4.10) \S{2T) - S{2T,)\ < 2n^l^\oin. 

Random walk 




Fig. 4. Proof of Lemma 4.3: when Brownian motion disconnects z from oo, random walk 
is close to doing so as well. 
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From the definition of Tj, (4.9) and (4.10), it is now clear that < Tj. 
Therefore, we have, for any z G C and n large enough, 

¥{{\Cn{z)\ < oo; \Cn{z)\ = oo};B{z);r) 

<F{\Cn{z)\=oo;fi<n-n^/^°)+F{T, G [n-n^A^n]) 
< F{f, -fl> n^/iO) +P(T^ G [n - n^/^^n]) 

by Lemmas 4.1(b) and 4.2(a). Part (b) is done in the same way, but we use 
Lemmas 4.1(a) and 4.2(c). □ 

The next two preparatory lemmas give estimates similar to those in Lem- 
mas 4.1 and 4.2, and will be used to prove Lemma 4.6, where we show that 
for points y and z that are neither too close to the boundary of Cn{z) nor 
to each other, it is unlikely that y ^ Cn{z) if y G Cn{z). 

We introduce stopping times reminiscent of those defined at the beginning 
of this section but which are concerned with the time at which points y and 
z first lie (or "almost" lie) in different components of the complement of the 
Brownian or random walk path. If y, z G C, 

Ty^^ = inf{t > : 30 < s < t with B{s) = B{t), 

|arg^(B(s),i?(t))|/|arg,(B(s),i?(t))|}, 
fy^^ = inf{t > : 30 < r < t with S{r) = S{t), 

|arg^(5(r),5(0)|/|arg,(5(r),5(i))|}, 
T^z = Ty,z{n) = inf{i > : 30 < s < t with d{B{s),B{t)) < 3n^/^ log^(n), 

I aTgy{B{s),B{t)) - arg,{B{s),Bm > 37r/2}, 

tI,z = tIA^) = inf{t > : 30 < r < t with d{S{r), S{t)) < 3?i^/^ log2(n), 

I a.igy{B{s),B{t)) - arg,(B(s),5(t))| > 37r/2}. 

We now give the analogues of Lemmas 4.1 and 4.2 for the quantities 
we just defined. As the arguments are the same, we only indicate which 
modifications from the proofs of Lemmas 4.1 and 4.2 are needed. 

Lemma 4.4. There exists a constant K > such that for any y,z £ 1? , 
any n > 1 and any a> 1/2: 

(a) F{Ty,z - Tl -^ > n'') < /^(n^i-^'^)/^^ bgn + \y - z\-'^/^n^/'^^logn) . 

(b) P(T;,^ - f^, > n") < K(n(i-2a)/24iQg^_^ |y _ z\~^/^n^/'^^logn). 
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Proof. The proof is essentially the same as that of Lemma 4.1. The 
only difference is that we need to take into account the relative positions of 
y and .,, as well as those of z and .,. We define m{y,z,a) = min{|z — 
Xy^J, \y — Xy^^|,n"/^/logn}. By symmetry, we can assume \z — Xy^J < |y — 
x^yj. We then find that if m{y,z,a) > lOOn^/^ log^ n, then ^{Ty^:, - T^^ > 
71°") < Kn^/^{m{y, z,a))^^^'^logn and if m{y,z,a) = \z — Xy^J, then 

min{|j/ — x^yJ,n'^/'^/logn}J 

Looking separately at the cases \y — Xy J < n"/^/logn and n"/^/logn < 
\y — Xy^^\, we find that 

Since we assumed that \z — Xy^J < |y — Xy^^l-: we have \y — Xy^J >\y — z\/2, 
which concludes the proof. □ 



Lemma 4.5. There exists a constant K > such that for any y,z £ 
Z^,n£ N and a> 1/2: 

(a) F{Ty^, e[n- n", n]) < Kn^''~^^l^ log^^/^ n, 

(b) P(r^^^ G [n - n", n]) < Kn^''-^)/^ log^^/^ n, 

(c) P(Ty,^ E [n - n'', n\) < Kn^''~^^l^ log^^/^ n, 

(d) P(f^^^ G [n - n*^, n]) < J^n('*^i)/3 logH/s 

Proof. The proof is virtually the same as that of Lemma 4.2 except the 
definitions of F and F' must now be F = '^y^z,w = inf{t > '■ \B{t) — w\ = 
min{|z — w\,\y — w\}}, and F' = F^ = sup{t < T] : \B{t) — w\ = min{|z — 

w\^y-w\}}■ n 



Recall the definitions made in (4.8). 

Lemma 4.6. For any given y,z ^ 1? : 

(a) P({y G Cn{z) \ Cn{z)};B{z)-B{y)-V) 

< Kmax{n~^/^'^ log^ n, \y — z|~^/^n-'^/^'*logn}. 

(b) P({y G Cn{z) \ Cn{z)};B{zy,B{y);V) 

< Kmax{n~^/^'^ log^ n, \y — z|~^/^n-'^/^^logn}. 



Proof, (a) Note that y and z lie in different components of C \ i?[0,t] 
if and only if t > Ty^^- Exactly as in Lemma 4.3, {B{z);B{y);V} C {Ty ., < 
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Ty^z}, and so, for a > 1/2, 

F{{yeCniz)\Cn{z)};Bizy,Biyy,V) 

< P(y G Cniz);Tl, < n - n") + P(4, G [71 - n^ n]). 

The result now follows from Lemmas 4.4(a) and 4.5(c). 

The proof of (b) is the same but uses Lemmas 4.4(b) and 4.5(a). □ 

We are now ready to attack the core of the argument. The following propo- 
sition shows that under the coupling, if C„(z) and Cn{z) are both finite, one 
of them is "large enough," and z is not too close to their boundaries, then the 
difference between the areas of Cn{z) and C„(z) is usually small, relatively 
to these areas. We define 

/^{z) = \\Cn{z)\-\Cn{z)\\, 

the difference in area between the Brownian motion and random walk hole 
containing a given point z ^H? and 

e{z) = {\Cn{z)\ < 00; \Cn{z)\ < 00}, 

the condition that both the Brownian and random walk components con- 
taining z are finite. Recah that V = {supo<t<„ \B{t) - S{2t)\ < n^/^log^n} 
and B{z) = {d{z,dCn(z)) > lOOn^Mog^i}. 

Proposition 4.1. For every Ki > 0, there exists a constant K2 > 
such that for any 2; G C, n G N and < 6 < 1/30, 

¥{{A{z) > Kin^-''};B{zy,£{zy,V) < K2n''-^/^'^log^ n. 

Proof. We will prove the proposition under the additional assumption 
that 

M=\ sup \B{t)\ < y/nlogn\ 

is satisfied, which will suffice since ¥{J\f^) decays rapidly by Lemma 2.1. We 
will use the abbreviation 

g = g{z, n) := B{z) n £{z) nvnAf 

and show that 

E[A(z);g] <i^n^-^/3°log^n. 
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Once we have this, the proposition follows from Chebyshev's inequality. Note 
that 



(4.11) 



A{z) < #{y G Z2 : Sq(y) n 5C„(z) / 0} 

+ #{yeZ^:Sq{y)ndCn{z)^0} 

+ #{yeZ^:y€Cn{z)\Cn{z)} 
+ #{yeZ^:y€Cn{z)\Cn{z)}. 



The first two terms on the right-hand side are related to the Hausdorff di- 
mension of the Brownian frontier and were dealt with in Section 3. If we 
modify (4.11) slightly and "thicken" the boundary, it follows from Proposi- 
tion 3.1 that 

E[A{z);g] < E[#{y G : d{y, dCn{z)) < lOOn^/^ log^ n}] 

+ E[#{y G Z2 : d{y, dCn{z)) < lOOn^^ log2 n}] 



(4.12) 



+ E ^ l{yeCn{z)\C^{z);g;B{y)} 
+ l{y^Cn{z)\Cn{z);g;B{y)} 



<Kn^^'^\\ognf"^+ ^ n{y^Cn{z)\Cn{z)y,G;B{y)) 
+ n{y^Cn{z)\Cn{z)};g;B{y)). 

By Lemma 4.6(a) and the definition of Q, 
Y,ny^Cn{z)\Cn{zy,g;B{y)} 

< Yl n{y(^Cn{^)\Cn{z)y,B{z);Biy);V) 

2/GD(v^logn)nZ2 



<K ^ 1+ Y nV24|y|-l/6log 

|j/|<ni/'l nl/4<|y|<„9/20 

+ Y n-i/30log2n 

n9/20<|«|<ni/2Wn 



n 
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and the same bound holds for the second sum in (4.12). Thus, 

E[A(z);g] <i^nl^/12(iog^)17/3^^^1-l/30iQg4^<^^l-l/30iQg4^_ □ 

5. Main results. 

Proposition 5.1. There exists a probability space (r2,jr,P) containing 
a planar simple random walk S and a planar standard Brownian motion B 
such that, if for 5 € M, 

Nn{5) = ^^{connected components of C \ B[0,n] of area >n^~^}, 

-^n('5) = i^{connected components of C \ S[0,2n] of area >n^~^}, 

]^n{S) = ^{connected components of 1? \ S'[0,2n] of cardinality > n^~^}, 

then for every e > 0, every < 5 < 1/60, 

(5.1) ¥{\Nn{6)-Nn{S)\>eNn{6))^0 asn^oo 
and 

(5.2) ¥{\Nn{5)-Nn{5)\>eNn{5))^0 asn^oo. 

Remark. We wih only prove (5.1), as the proof of (5.2) is virtually the 
same. The reason why the same proof works is that what would cause the 
random walk hole containing a point z to have an area that is substantially 
different from the lattice area of the random walk lattice hole containing that 
point is their boundary behavior and that boundary effect is eliminated in 
Proposition 4.1 using the work done in Section 3. 

Before proving (5.1), we introduce the notation used in this section, as 
well as the extensions of [10] which are essential to our proof. Although most 
of the quantities we are about to define depend on 6, we will usually let this 
dependence be implicit to keep the notation as light as possible: 

^[a,b) = ^[a,b){n') = #{conn. components of C \ i?[0,ri] with area in [a,b)}, 

^[a,b) = #{conn. components of C \ S[0, 2n] with area in [a, b)}, 

In = In{6) = [n^^^ oo), Nn{S) = Ni,^, Nn{6) = Ni„. 

We let 

c = c{e) = 1 + e 

and define for all j > — 1, 
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Fig. 5. T/ie intervals needed in the proof of Proposition 5.1. 



for all j > 0, 



ni^V,n^-V(l+e")), 



See Figures 5 and 6. The number of components in the corresponding inter- 
vals will be 



(5.3) 



L Ar _ i\tR 



1^ ) 



J," 



We let 

Zj^^ = the set of all components of C\B[0,n] with area in I^^, 

Zj^n = the set of all components of C \ ^[0, 2n] with area in 

^j,n = the set of all components of C \ i3[0,n]with area in Ij^n- 

The details on the results which we state now and which follow from [10], 
can be found in the Appendix. If we let 



log2(n^)' 

then for every K > 0, 

(5.4) r{\Nn{S)-^n\>Kjn)^0 
In particular, for every 5 > 0, Ki < 1 < K2, 

(5.5) lim ¥{Nn{8) < Kijn) = and lim F{Nn{S) > i^27n) = 0. 



as n ^ 00. 



Fig. 6. Splitting [n^ *,c>o) into small finite intervals. 
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We define 



27rnMogc _ 27rn'^ log(l + e^^ 



ci log^ I (Trn^) ' ^'"^ log^ / (vrn'' ) 
(5.6) 

27m^ logc/(l + £2)2 



7o 



and point out tliat for every 6 > 0, there exists a K = K[6) > sucli that 

rAlogni 
^ 2 log c J 



for ah 7i > 1, e > and < i < [^^1 



(5-7) |7/;^-e7,>l<i^'e\n- 

Then (A. 5) and (A. 6) imply that for every Ki > and 5 > 0, there is a 
constant K2 = K2{Ki,S) > such that for all n large enough, e > small 
enough, and ah 0<j<[|i^], 

(5.8) P(|iV,- „ - 7j- „| > Kaj,n) < K2 log-3/2 

and the same inequality holds if we replace the pair {Nj^n, li,n) by {Nf'^n^ '^f,n)^ 
{Np^,^j"J^), or (-/V~„,7~^). We are now ready to prove the proposition: 

Proof of Proposition 5.1. We wiU show that for all < 6 < 1/60 
and e > 0, 

F{\Nn{6) - Nn{6)\ > lleiV„(<5)) ^0 as n ^ 00. 

We first note that "most" of the Brownian holes of area greater than n^^^ 
have an area which is "close to" n^~^ . For instance, as can be seen from 
(5.4), the number of holes for B with area in the interval [n^~'^, n^"'^/^] 
is typically of greater order of magnitude than the number of Brownian 
holes with area in the interval Recall that c = 1 + e. If we let 

m = m{n,e,6) = [^^], then n^-'^c™ < n^^^/^ < ^i-5^m,+i 

F{\Nn{6)-Nn{6)\>neNn{6)) 

(5.9) < P(|iV[„i-*_„i-.,,„) - N[^^-s^^i-s,m)\ > WeNnid)) 

The last term of (5.9) goes to as n goes to 00. Indeed, if |-^[ni-'5 0^,00) ~ 
A^[„i-ic"i,oo)l > ^-^n(^), then either N^n^-s^m^^^ > eNn{d) or ^^[ni-ac^.oo) > 
eNn{d). We also know from (5.5) that ¥{Nn{5) < j^^) = o(l). By observ- 
ing that if -/V[ni-'5c'",oo) > ^ iog"n'^ ' total area enclosed by B is greater 
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than g 2"^+'^/^ 1^ ^j^^ same holds for 5, we can conclude that 

IP(l%i-*c-,oo) - ^[ni-*-c'",oo)l > £Nn{6)) 

<P sup \B(t)\>E^^^ ^ 

/ 2n(^+^/2)/2N 
+ P sup |5(2t)| ^ +o(l)^0 asn^oo, 

by (2.4) and (2.5). It now suffices to show that 

lirn P(|iV[„ I > 10eiV„(5)) =0. 

Observe that 

IP(l^[ni-«,nl-'c'") - ^[nl-'.nl-^c'")! > 10eiV„((5)) 

Cm— 1 m—1 \ 

^ |iV,-„-iV,-„|>10e^iV,-„ 
j=o i=o / 

m—1 

< 5] P(|iV,-,-iV,-„| >10e7V,-„). 
i=o 

Because of the definition of m, it now suffices to prove that for every e > 
0, < (5 < there is a > such that for all < j < m — 1, 

(5.10) P(|iVj- „ - iV,- „| > lOeiVj, „) < i^^(n), 

where 'i/'('^) = o(log~"'^n). Recall that we defined 

Zj^n = {connected components of C \ 5[0, 2n] with area in /j,n}, 

Zj^n = {connected components of C \ -B[0,n] with area in /j,n.}- 

Unfortunately, we cannot use Proposition 4.1 to show (5.10) quite yet. The 
problem is that the fact that \Nj^n — -^j,n| > 10eA'j-,„, does not imply anything 
about \\Zj^n\ — \Zj,n\\ and so estimates about the latter cannot be used to 
show (5.10). To make things work, we need the interval containing the areas 
of random walk holes to be strictly included in the interval containing the 
areas of Brownian motion holes (or vice versa). The key in creating such a 
situation is to observe that 

P(|%„-A^j-„| >10eiVj,„) 

, , < m^.n < (1 - 2e)N-^) + P(iV,- „ > (1 + 2e)iV+J 

(5.11) 

+ P(iV^„ > 4e7VrJ + P(Ar^„ > 4eiV-J 

+ P(iV/+i > 7/2eiV,- „) + P(Ar/^i > 7/2eN,^n)- 
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In words, if iVj.„ is much greater than Nj^n, then either it is somewhat 
greater than N^^, or there are many Brownian holes with area in /^"^ \ Ij n- 
This scheme wih work because we have defined in such a way that the 
Lebesgue measure of \ Ij^n is of a smaher order of magnitude than the 
Lebesgue measure of Ij^n- 

P{iV^„>4e7Vrj 

+ P(A^„ > §7];^) + mi,n < h7,n) 

We know from (5.7) that if e is small enough, |7j'^ < 2ejJ^, so the first 
term is 0. The second and the third are ©((logn)"^/^), uniformly for < 
J < m — 1, by (5.8), and the last three terms of (5.11) can be bounded in 
the same way. Therefore, 

P(|iVj-„-iV,-„| >10eA^,-„) 

(5.12) <P(%„<(l-2e)iVrJ 

+ P(iV,-„ > (1 + 2e)N+J + 0((logn)-3/2). 



We define 



To bound the first term on the right-hand side of (5.12), note that 
\ZrJ > Nr^cin^'^l + e^) and „| < iVj, ^c^'+^n^-^ 

so that if Nj^n < (1 - '^^)^i,n^ then 

AT > Nry~^c>[l + £2 _ (1 _ 2e)c\ = N-y-^c^{e + 3e^). 

Since iVr„ E N and we know from (5.8) that P(iVj"„ = 0) < if (log n)"^/^ 
we see that 

(5.13) P(iV,, „ < (1 - 2e)iV-„; A" < en^-^) < K{logn)-'^/\ 

The obvious inequality \A\ — \B\ < \ A\B\ for sets A,B£C and (5.13) imply 
that 

P(iV,- „ < (1 - 2e)iV-J < P(|Z-„ \ > en'-') + 0((logn)-3/2). 

But 
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where the union is over the union of {z G Z2 : d{z, dZj-J < log^ n} 

and {z G Z2:|C„(z)| G /".HCnl^)! ^ Ij,n;d{z,dZ-J >'l00ni/4 log2 n}. We 
can now combine the different pieces of our work and write 

< E ^ l{d{z, dZjJ < 100?i^/^ log2 n} 

-\z\<Vnlogn 

+ n{\Cn{z)\€q^;\Cn{z)\^I,,n};B{zy,V) 

|z|<v^logn 

< Kn(33+'5)/36 iogi4/3 + J2 F{{A{z)>e^n^^^};B{z);£{z);V) 

\z\<\/nlogn 

+ n{\Cn{z)\ G \Cn{z)\ = oo};B{z);V) 

\z\<y/n\ogn 

< K(n"/12+<5/36 iQgl4/3 ^ ^ ^1+5-1/30 j^gG ^ ^ ^1-1/30 i^gl6/3 ^) 
<^„l+5-l/30i^g6^^ 

by Propositions 3.2 and 4.1 and Lemma 4.3. The sums are over elements of 
and the constant K may depend on e. Therefore, 

F(|Zr„\4„|>eni-^) 

For every e > and < 6 < 1/60, this decays faster than (logn)~3/2_ xhe 
second term of (5.12) is bounded in the same way. It now suffices to look 
back at (5.10) to see that the proof is complete. □ 

Given this proposition, the proof of Theorem 1.1 is straightforward: 

P(|iV„(d)-27r7„| >e7„) 

< pf |iV„(<^) - Nn{6)\ > ^-fn) +F(\Nn{6) - 27r7„| > J7„, 



<P( |A^„(5) -iV„(5)| > -iV„(<5) ) +P(7„ < Af„(5)/2) 



'(^|A^„(<5)-27r7„,|>|7„). 
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From Proposition 5.1, (5.5) and (5.4), we know that for every e > 0, every 
< S < 1/60, each of the 3 terms goes to as n goes to infinity. The same 
holds if we replace Nn{S) by N, which concludes the proof of the theorem. 

Remark. The statement of Theorem 1.1 can be written in the following 
equivalent form: If A{n) = n^~^,0 < 6 < 6o, then with the notation defined 
in (1.2), 

(5.14) log^(^M(n)) P^^ ^^^^^ 
^ ^ n/A{n) '^v \ 

(5.15) / \, \ LniAin))^^!: asn^oo. 
nj Ayn) 

A closer look at the proof shows that in fact, (5.14) and (5.15) hold for 
arbitrary A(n)^ as long as n^""^" < A(n) and A{n) = o(n). A straightforward 
consequence of this and Lemma 2.1 is that if ■n^^^^'' < A{n) and A{n) = o{n), 
then 

logE[W„(A(n))]-log(n/A(n)) 

and 

logE[£n{A{n))]r^log{n/A{n)). 



6. Small holes. Our interest in this problem was spurred by the obser- 
vation by Mandelbrot (see [11]), based on computer simulations, that while 
the number of random walk holes and the number of large lattice holes are 
governed by a power function with exponent 2, the exponent for small lattice 
holes is 5/3. 

The remark at the end of Section 5 shows that if A{n) > n^~^ and A{n) = 
o{n), then logE[£„(A(n))] ~ \og{nA{n)-^l'^), with ^ = 2. The number ^ is 
what Mandelbrot calls the exponent. 

A formal definition can be made as follows: Given a sequence of inter- 
vals {/„}„gM) a sequence of functions {/„}:/,!— >M has exponent ^ over 
the intervals J„ C In if for any two sequences {A{n)}ri&n-,{A' {n)}n<^n with 
A{n),A'{n) £ J„, 

(6.1) log(/„(A(n))//„(^'(n))) ~ -| log(A(n)/A'(n)). 

With the notation above, Mandelbrot's observation can be described as fol- 
lows: 

There exist sequences Aq,Ai, and A2 with logyl2(n) ~ logn, such that: 

• E[>C„(-)] has exponent 5/3 over [l,ylo('^)]- 

• E[£„(-)] has exponent strictly between 5/3 and 2 over [AQ{n), Ai{n)]. 
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logE[£„(^(n))] 
log 



fto(") := log j4o{ri) 
ai(ii) := log Ai(fi) 



ftn'{")Ai(n,) 




log I! 



'\ogA{n) 



Fig. 7. Two regimes. The log-log plot yields a slope of —1 for large holes and —5/6 for 
small holes, corresponding to exponents of 2 and 5/3, respectively. 

• E[£„(-)] has exponent 2 over [Ai{n),A2{n)\. 

As mentioned above, Theorem 1.1 imphes that there is a (5o > such that 
the third point is true with Ai{n) = n^""^" and any A2{n) satisfying A2{n) = 
o(?i), log ^2(77.) ~ logn, and the same holds if E[£„(-)] is replaced by E['H„(-)]. 

It turns out we can say much more about Ai if the situation is exactly as 
described above in the three points. Assuming that situation, it is easy to see 
that if Ai{n) = for some e > 0, then lim„_>oo log E[/^fi(l)] / logn < 1 [see 
Figure 7 and recall that is the number of lattice holes of lattice area 

> 1]. We will see in the lemma below that in fact, logE[£„(l)] ~ logn, thus 
showing that in the situation described above, Ai (n) would necessarily grow 
more slowly than any power function. This would imply that logAi(n) = 
o(logn), so that the region in Figure 7 over which the exponent 5/3 may 
be observed would become a smaller and smaller fraction of the domain of 
logE[£„(-)], as n— > 00. The same is true if we replace E[£„(-)] by E['H,„(-)]. 

We devote the remainder of this section to proving that logE[£„(l)] ~ 
logn and logE[7Yn(l)] ~ logn, and conclude by stating a conjecture based 
on Mandelbrot's picture and the results derived in this paper. 

We say that two (lattice) holes made by S'[0, 2n], H and H' , are equivalent 
if there exists a z S such that H' = H + z. We choose a representative 
H for each equivalence class by requesting that the first point of dH in the 
lexicographic order be the origin and call the set of these representatives 
TZi for holes and 7^2 for lattice holes. Given a representative H and a point 
z G Z^, we define H:,:=H + z. We will write Sz=HzLI dH;, and Tz{H) = 
mf{k > : S{k) G H^}. For each H in TZi or 7^2, we define Nh = #{z G : 
Hz is a (lattice) hole made by 5[0, 2n]}. 
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Lemma 6.1. For any given hole H £ IZi or lattice hole H £ IZ2, there 
exists a K >Q such that for all n> 1, 

n 



E[Nh] > K 



log n 



Proof. We give the proof for H eTZi. The case where H eTZ2 is done 
in the same way. Suppose the boundary of H is composed of 2k line segments 
of length 1 and that n>8k. Then E[A'^j:/] is equal to 

^ F{H, is a hole for 5[0,2n]) 

2n-2fc 

= ^ ^ F{H, is a hole for S[0,2n];T,{H) = I) 

z&? 1=0 
[3n/2] 

> E E nS[O,l-l]nH, = 0; 

2gZ2 l=[n/2] 

S[l, l + 2k]= dH,;S[l + 2k, 2n] nH, = 0) 
>K min W""{S\lA'inl2V\r\H = 0)n 

X min P"'(5'[0, [3n/2]] ni? = 0), 

where the last inequality is obtained from the Markov property, by consid- 
ering the time reversal of S[Q,l — 1], and by translation invariance of simple 
random walk. The fact that for any hole H and w ^ H, there is a > 
such that P"'(5'[0,n] f] H = 0) K/logn, which follows from Lemma 2.3.1 
in [3], concludes the proof. □ 



Corollary 6.1. 

logE[7Yn(l)] ~ logn and logE[£„(l)] ~ logn. 



Proof. This follows immediately from Lemma 6.1 and the fact that 
there exists a constant K > such that E['H„(1)] < Knlog^n and E[£n(l)] < 
i^nlog^n, which is a direct consequence of Lemma 2.1. □ 

We now summarize what we believe should be the global picture, based 
on the work presented in this paper and our understanding of Mandelbrot's 
observation: The exponent for the expected number of (lattice) holes is 2, 
except for the case of small lattice holes, for which the exponent is 5/3. The 
exponent 5/3, however, only holds for lattice holes whose lattice areas grow 
more slowly than any power function. 
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Conjecture. There exist nondecreasing sequences Aq,Ai and A2, with 
1 < Ao(n) < Ai{n) < A2{n), where Ai{n) = o(logn) and logA2{n) ~ logn, 
such that 

f E[nn{A{n))] \ f fA{n 



and 



with: 



log 



E[nn{A'in))]J 2 ^yA'in) 
E[Cn{A{n))]\ rj/A{n 



i = 2 ifl<A{n),A'{n)<A2{n), 

7] = 2 if Ai{n)<A{n),A'{n)<A2{n), 

7?=f ifl<A{n),A'{n)<Ao{n). 



E[Cn{A'{n))]J 2\A'{n) 



APPENDIX: FROM SMALL TO LARGE BROWNIAN HOLES 

In [10], estimates for the expectation and the variance of \Urf\, the area of 
Ur^, are derived, where 

a, = {yeC:7r{Xr]f<\Ci{y)\<7rr]^}, 

A < 1, and Ci{y) is defined as in (2.2). In particular, it is shown that the 
variance is of smaller order of magnitude than the second moment. The two 
estimates which are relevant to us are the following: 

1. 

where 0{-) is for r/ ^ 0, but the implied constant may depend on A. 
2. There exists a constant K > such that for every rj £ (0, 1/4), 

(A.3) Var(|C/^|) </C|logr?|~ii/2^ 

If we let A(;^ = {y £ C : (n < |C„(y)| < c^n}, where c = 1 + e > 1, then the 
scaling property of Brownian motion allow us to deduce the following from 
(A. 2) and (A.3): 

_ 27rnlogc / / log | log(cC)| \ 

'^['^^']-i^?KAol' + ^l|iog(cC)|vO 

where 0{-) is for C ~^ 0, but the implied constant may depend on c, 
(A.4) Var(l^^l) < AV| log(cC)^^^/^ 
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for all C with cC G (0, 7r/16). 

We can now easily translate these facts into results about the number of 
components of area lying in a certain interval, rather than the total area 
covered by these components. This just requires dividing the total area by 
the area of a single component. Since for y G yl^, |C„(y)| can take any value 
in [i^n,cCn], we have an additional error term. 

If (5 > and m = [ ^^^°|" ], then for every j <m — 1, components of area in 

Ij^n [see (1.5)] have area less than n^~^/^ < j^n, so we can use (A. 4). With 
the definitions of (5.3) and (5.6), we have for < j < m — 1, 

log I log(c-'n~'^ 



(A,5) E[iV,„l=.„.^l-r. + .^|j^^|^^^_,^|,^, 

(A.6) Vai(A'j,„) < K^lJ log{ci,r']\-^'\ 

log I log(cy'n~'^)| 



nNj-J =7-^(1 + re + i 
Var(iV-J<Jf(7-j2|log(c'n 



|log(cJn-'5)|V2 

--5^1-3/2 



log(cJn-^)|V2 
Var(iV^J < /^(7i:^)'| log(c'n-^)r3/2, 
and for — l<j<m— 1, 

MlA'«H.-(l...^.0(M« 

Var(iV/t,) < i^(7i:^)'l log{c^ n^'r^/^ 

where \r\ < 2 and K and the constants of O may depend on e. 0{-) is for 
n — > 00. (5.8) and its analogues follow directly from the set of equations 
above and (5.4) is a consequence of a scaled version of the main result in 
[10]: If N{u) is the number of connected components of C \ B[0, 1] of area 
greater than u, then 

lim ii(logn)^A^(ii) = 27r a.s. 
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